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Abstract — This paper investigates the optimum source trans- 
■ mission strategy to maximize the capacity of a multiple-input 
single-output (MISO) amplify-and-forward relay channel, as- 
suming source-relay channel mean feedback at the source. The 
challenge here is that relaying introduces a nonconvex structure 
in the objective function, thereby excluding the possible use of 
previous methods dealing with mean feedback that generally rely 
on the concavity of the objective function. A novel method is 
employed, which divides the feasible set into two subsets and 
establishes the optimum from one of them by comparison. As 
such, the optimization is transformed into the comparison of two 
nonnegative random variables in the Laplace transform order, 
which is one of the important stochastic orders. It turns out that 
the optimum transmission strategy is to transmit along the known 
channel mean and its orthogonal eigenchannels. The condition 
for rank-one precoding (beamforming) to achieve capacity is also 
determined. Our results subsume those for traditional MISO 
precoding with mean feedback. 

I. Introduction 

I Wireless relay systems have been a recent subject of in- 
tensive research due to their potential of providing increased 
diversity, extended coverage, or flexibility in compromising 
system performance and complexity/power consumption. Effi- 
] cient relaying protocols have been developed, among which 
t the most popular are the amplify-and-forward (AF) and 
decode-and-forward protocols [1]. 

To further enhance system performance, multiple antennas 
are deployed on one or more nodes of cooperative relay 
networks [2] -[6]. However, in multi-antenna systems, the opti- 
mum system structure and the resultant performance (capacity, 
error rate, mean-square error, etc.) depend heavily on the na- 
ture of channels and the channel state information (CSI) avail- 
able at the transceiver (see [7]-[13] and references therein). 
Therefore, research on multi-antenna relay communications 
has progressed from early work assuming perfect CSI [3] [4] 
to recent studies assuming more realistic partial CSI [5]. 

Consider a half-duplex AF relay link with M antennas 
(M > 2) at the source and a single antenna at both the 
relay and the destination. This scenario typically occurs when 
a traditional multiple-input single-output (MISO) link (without 
relaying) is obstructed, and then a relay is used to maintain 
the link and coverage. We assume that the destination has full 
knowledge of the source-relay and relay-destination channels. 



whereas the relay and the source are only aware of the long- 
term statistics of the source-relay channel. Relaying without 
instantaneous CSI is commonly referred to as noncoherent 
relaying. 

The above scenario has been considered in [5], where the 
source and the relay are assumed to have the long-term covari- 
ance information of a rapidly changing source-relay channel, 
and the optimum source covariance to maximize the ergodic 
capacity is determined. However, when the mean information^ 
of a slowly varying source-relay channel is available at the 
source, the capacity-achieving transmission strategy remains 
as an open problem and is tackled in this work. Mean feedback 
models the uncertainty in CSI due to channel estimation errors, 
quantization errors or imperfectness of the feedback link, and 
is feasible in slowly varying fading channels [8]. 

Channel mean and covariance feedback generally require 
different treatments, as they constrain the solution differently. 
Regarding utilizing mean feedback for maximum ergodic ca- 
pacity in traditional MISO, the optimization method in [9] [10] 
is based on calculus of variations using the Frechet differential, 
whereas in [11], the expression for ergodic capacity is first 
obtained and then ordinary calculus is used for optimization. 
Both methods rely on the concavity of the objective function in 
transmit covariance [9]-[ll]. However, such concavity cannot 
be established in our case, and these methods do not apply. 

We, therefore, take a nonconventional powerful approach 
based on one of the stochastic orders, the Laplace transform 
order [17], which circumvents the requirement of concavity. 
Optimum source precoding matrix (or equivalently, the covari- 
ance matrix) is determined. A special case of precoding, i.e., 
beamforming, occurs when the covariance matrix is of rank 
one, which has appealing reduced complexity but may not 
always be capacity-optimum. Here we derive the necessary 
and sufficient condition for it to achieve capacity. All our 
results subsume as a special case those for traditional MISO 
precoding with channel mean feedback. 

II. System model and problem statement 

We focus on a half-duplex AF link with M antennas at the 
source and with a single antenna at both the relay and the 

'Channel covariance (mean) information at the transmitter is also referred 
to as channel covariance (mean) feedback [7]. 



destination,^ resulting in a MISO source-relay link (backward 
channel) and a single-input single-output relay-destination link 
(forward channel). No direct source-destination link exists. 
The backward channel hs is modeled as [9]: 



(1) 



where h^, represents the scattering and is distributed as 
-A/'c(0,Im) (circularly symmetric complex Gaussian), and a is 
a nonnegative scaling constant (a > 0). Only the knowledge 
of the source-relay channel mean /x and a is provided to the 
source, which we refer to as channel mean feedback (at the 
source). In the first time slot, the received signal in the source- 
relay Unk is given by 

ri = h^Fx -I- m, 

where x is the source signal with Ejxx'''} = and ni is 

the noise at the relay distributed as ^4(0, 1). The precoding 
(shaping) matrix F is related to the transmit covariance matrix 
Q through Q = -pFF^. A transmit power constraint is 
imposed on the source: E{tr(Fx(Fx)''')} = 7tr(Q) < 7, 
which yields the constraint on the transmit covariance matrix: 
tr(Q) < 1- Denote the amplifying factor of the relay as rj. Due 
to noncoherent relaying, a long-term power constraint, G, is 
imposed on the relay, i.e., 

C? = E{|r?ri|2} = ?72[7jutQ/i + 7atr(Q) + 1]. (2) 



Thus, 77 = a/G/ {1 7[AitQ;i -|- atr(Q)]}. In (2), we have 
used: E{h]^Qhti,,} = tr(Q), since h.^ is A/'c(0,Im). In the 
second time slot, the received signal in the relay-destination 
link is given by 

r2=r]hF n +n2, 

where hp is the forward channel coefficient, and n2 is the 
noise at the destination with distribution A/'c(0, 1). 

Perfect knowledge of hs and hp is assumed at the desti- 
nation. No knowledge about hp is available at the source or 
relay. The ergodic capacity of the above relay channel is given 
below [5] 



max ^Ehs, hp 

Q^O 2 
tr(Q)<l 



log 



1 



r]^j\hF\^hlQhB 



v'\hF\ 



(3) 



where ^ is due to the half-duplex assumption. The log function 
here denotes natural logarithm, and thus the unit is nats per 
channel use. After substituting rj into (3) and applying a 
technique used in [21, Appendix A], the ergodic capacity 
under mean feedback can be further shown equivalent to: 



max 

Q^O, tr(Q) = 



C(Q) 



(4) 



where 



c(Q) = ^EhB, 



log 



G|fefph^QhB 



1 + 



(5) 



The analysis in this paper can be extended to the case when the relay has 
multiple antennas. Details are not discussed due to space constraint. 



Our goal is to find the optimum Q for the problem defined 
by (4)-(5). The main challenge here is that in (5), the log 
function inside the expectation operator is nonconvex in Q. 
Previous methods for MISO precoding with channel mean 
feedback [9]-[ll], which generally utiUze the concavity of 
the objective function in the transmit covariance matrix, are 
not applicable here. Below we employ a new method based 
on the Laplace transform order [17]. Some mathematical 
preliminaries pertaining to the Laplace transform order are 
given in Appendix I. 

III. Optimum Source Covariance Matrix 

Theorem 1 The optimum Q for the problem (4) is given 
by Qopt = V$V^ where 



^ [Wll/^ll 
diag|(?i 



V2 
1 - 



M -V'"' M -1 



(6) 
(7) 



and V2 . . . vm are arbitrary orthonormal vectors orthogonal to 
/X in C^. 

Proof: Due to space constraint, we only provide a detailed 
outline here. A complete proof can be found in [14]. 

To ease the presentation, we first sketch the basic idea. A 
globally optimum Q exists for (4) due to the Weierstrass' 
Theorem [19]. We divide the feasible set into two subsets, 
represented by Qi and Q2. Specifically, denote as Qi a 
feasible but otherwise arbitrary covariance matrix, which has 
none of its eigenvectors aligned with /x. Let Q2 have the same 
eigenvectors as Qopt [see (6)]. Here Q2 is customized for 
the proof. Our goal is to show that given any Qi, by proper 
power allocation in Q2, we can always have C(Q2) > C(Qi). 
We will show that to maximize C(Q2), equal power must be 
allocated to V2 . . . vm as in (7). We will also show that Q2 
with its optimum power allocation achieves at least the same 
ergodic capacity as Qi does. The optimality of Qopt can then 
be established. Below the proof starts. 

To facilitate subsequent comparison, our first step is to 
exploit the eigen-structures of covariance matrices. Let Qi 
be eigen-decomposed as Qi = UAU^ with 

(8) 



U = [ui . . . um], A = diag{Ai, . . . , Am}- 

Here {ui . . . um} is an arbitrary orthonormal basis in 
among which none is aligned with /Lt, and Yl!iLi — 
Q2 have the same eigenvectors as given in (6), i.e., 

M 

Q2=V*Vt, * = diag{0i,(^2,...,<^M}, ^<^i = l. (9) 

i=l 

Also define /3 = . . . PmV = U^. Clearly, = \\j3\\. 
From (8) and (1), we obtain /lz^Qi/lx = X^^^ A,|/3,p, and 

h^QihB = [/3 + x/^h^]tA[/3 + ^/Sh^] = aWqu 



where h„ 



\T A 



U^h,„ has the same distribu- 



(hwi ■ ■ ■ hwAi) 

tion as h.^, \hwi + is a noncentral chi-square random 

variable of two degrees of freedom with the noncentrality 



parameter given by \f3i\^ /a, for all i [16, p. 43], and Wq\ = 
T.ti + Similarly, /xtQ^/x = and 

= [(||/x|| . . . 0) + x/S hi,] * [(||/x|| . . . 0)^ + x/S h^] 

= aWQ2, 



p. Here h„ 



/ijum)^ — V^h^ has the same distribution as h^,, and 



where VFq2 + 

thus \hwi\^ is distributed as central chi-square of two degrees 
of freedom (or simply, exponential), i = 2, . . . , M. 

Now, within the subset represented by Q2, it can be shown 
that, given any Q <(f>i <l, among all * matrices [see (9)], 



diag{(^ 



1 



1, 



M- r 



M- 



maximizes C(Q2); i.e., to maximize C(Q2), equal power jjzj 
must be allocated to V2 . . . vm [14]. Thus, we denote 



Wq2 = $1 



m 

\fOL 



M 



M 



Up to now, we have 



C(Qi) = -Eh«,/.^{log[l + hWQi]}, 



C(Q2 



-Eh«,/.^{l0g[l + fc2T^Q2]}, 



where optimum equal power allocation among V2 . 
used in Q2, 



k2 



G\hF\^a 



7 



(11) 

(12) 

(13) 
• Vm is 

(14) 
(15) 



Note that (12) and (13) differ not only in Wqi and Wq2, but 
also in fci and ^2, making further comparison prohibitively 
difficult. To proceed, we choose the only free (unspecified) 
parameter in Q2, i.e., as follows: 



M 



(16) 



" \m' 

such that fci = ^2 = fc > in (14) and (15). Since < 

mini K < — ^^'^i ■'^i — 1' (^6) is always valid. The 

choice of (pi in (16) is crucial and will be shown to enable 
the final comparison. Naturally, our next step is to show that 

Et, 



.Wp,{log(l + kWQi)} < Jlog(l + kWQ2)}, 



(17) 



when hp is given and 0i is chosen as per (16). 

A straightforward method to show the above is to calculate 
the expectations on both sides of the inequality. The difficulty 
here is that the calculation involves the probability density 
function (p.d.f.) of a convex combination of M non-central 
chi-square random variables, which is too complicated to serve 
our purpose [15]. On the other hand, the Laplace transform 



of this p.d.f. does possess a more elegant structure [16]. If we 
can avoid the p.d.f. and use its Laplace transform instead, we 
will be able to overcome the difficulty. It turns out that Lemma 
1 in Appendix I is the precise tool we need here. 
Based on Lemma 1, to show (17), it suffices to show that 

Wqi <lt Wq2, subject to (16). 

Let A4wqi{s) [A4^^^^{s)] be the Laplace transform of the 
p.d.f. of Wqi (Wq2)- According to Definition 1 (see Appendix 
I), it is equivalent to show that -^^^^^{8) < MwQi{s),ys > 
0, subject to (16), or, 

log [M,y(.s)/Mw,,^{s)] < 0, Vs > 0, under (16). (18) 



(10) It can be shown that [16, p. 43] 



log 

J{s) = log 



M^^^{s)/Mwq^{s)\ = J{s) - -n{s), 
{1 + \is)...{1 + \ms) 



(1 



n{s) 



4>i\m\ 

1 + ^is 



01 s) 
M 



M-l*y 



M-1 



i=l 



(19) 



(20) 



To show that J{s) < for aU s > 0, note that J^Zi log(l + 
Us) is a Schur-concave function in t = (ii . . AmY , for all 
s > 0, and subject to (16), 



1 



M-1 M-1 



-< (Ai . . . Am) : 



i.e., the left-hand side is majorized by the right-hand 
side [14][18]. We can also show that 7^(s) > 0, Vs > 0, 
by repeatedly using (16). Thus, (18) holds, which implies that 
(17) holds. Since the construction of Q2 involves only (6), 
(10) and (16), none of which depends on hp, and (17) holds 
for any hp, obtain 

C(Qi) = ^E^F {lEvyQi{log(l + kWQi)}} 



< 



^Eh, {e,,^,^^ Jlog(l + kWQ2)}} = C(Q2). (21) 



Based on the arbitrariness of Qi and (v2 . . . vm), we conclude 
that the optimum solution, as it exists, must have the same 
eigen-structure as Q2. Also, as seen in the proof, equal power 
allocation among V2...vm is necessary for optimal! ty. At 
this moment, the only parameter in Q2 available for further 
optimization is (pi. Though (pi chosen as in (16) is sufficient to 
guarantee (21) for a specific Qi, it can be potentially further 
optimized to obtain (p as in Qopt- Therefore, for (4), Qopt 
[see (6)-(7)] is the optimum with (p numerically optimized 
according to the fading statistics of and hp- O 

Remark 1 The fading distribution of the relay-destination 

channel hp has no effect on the optimum transmit directions 
(eigenvectors of the covariance matrix) at the source. However, 



it does affect the optimum value of 
by solving: maxo<0<i iE{log[l + 



(j>. In fact, (j) is determined 

G|/iFpac(0)]}, where 



2 I ^wi I 



1+G|/if|^ 



This problem can be readily solved using one-dimensional 
search methods [19]. It is also interesting to see that the 
capacity depends on fi only through its Euclidean length 

Remark 2 When the relay power G oo, (4) be- 
comes; maxq^o, ti(Q)=i 5lEhB{log[l + 7h^QhB]}, which is 
the same mathematical problem as in [9, Theorem 3.1]. Thus, 
our result subsumes as a special case the optimum (traditional) 
MISO precoding with channel mean feedback, and it is not 
surprising to see the result in Theorem 1 and that in [9, 
Theorem 3.1] share the same structure.^ In particular, our proof 
here can also serve to prove [9, Theorem 3.1]. 

IV. Optimality of Beamforming along /X 

Beamforming along the source-relay channel mean is 
optimum if and only if all the source transmit power is 
allocated to and thus 0=1 and Qopt is rank-one. 

Theorem 2 Assume that hp is distributed as A/'c(0, 1).'* 
Given 7, G, /x and a, beamforming in the direction of /x can 
achieve capacity if and only if 

E{Z}+^E{Zcxp(Z)r(0, Z)} 



<E{Z2cxp(Z)r(0,Z)} + D2, (22) 



where the expectation is taken with respect to the random 
variable Z with the following probability density function 



Pz{z) 



_ Di 



■ exp 



IMII 



1 

Q!7 



- 1 



x/o((2||/x||VGD7^PT)/(aV7)) ; < z < Di 



In the above, we have defined Di = {a'y + 1 + j\\fi\\'^)/G 



and D2 = 



A Di 
a7+l 



1 



711/^11 



r(a, x) 



IS 



^exp(I?i)r(0,I?i 
the complementary incomplete Gamma function [20, Eqs 
(6.5.3), (6.5.15)], and /o(a;) is the zeroth-order modified Bessel 
function of the first kind [20, Eqs. (9.6.10), (9.6.16)]. 

Proof : Due to space limitation, the proof is omit- 
ted. Numerical methods are required to evaluate E{Z}, 
E{Zexp{Z}r(0,Z)} and E{Z^ cxp{Z}T{0, Z)}. □ 

Remark 3 When G — > 00, (22) coincides with [10, 
Theorem 4, nji — 1]. 

V. Numerical Examples 

We now provide simulation results to corroborate the 
analytical results. We choose the number of antennas at the 
source M to be 2. Fig. 1 shows capacity versus 7. Since 
the noise power is normalized to one, 7 denotes the transmit 

^Similar observations have also been reported in [5] with channel covari- 
ance feedback. 

''Note that the result in Theorem 1 holds with any fading distribution of 
hp . However, the condition for beamforming to be optimum does depend on 
the distribution of hp. 




Y(dB) 



Fig. 1. Comparison of optimum and sub-optimum solutions for M = 
a = 0.1, fi = (0.3518 -I- jO.2496 - 0.4039 - jl.0437)^ (IImHV" 
14.3851), G = 15 dB- 



signal-to-noise ratio (SNR). Here the "optimum" refers to 
Qopt which achieves capacity, and the "sub-optimum" refers 
to a sub-optimum Qi (see the proof of Theorem 1) with 
numerically optimized power allocation. The optimality of 
Qopt is clearly shown in Fig. 1. At high 7, the difference 
between the rates using the optimum and the sub-optimum 
diminishes. From Fig. 2, capacity increases with with 
other parameters fixed. Similar observations can be made with 
different sets of parameters, which are not presented here due 
to space constraint. 

Fig. 3 gives simulation results to corroborate (22). Consider 
the case with two antennas at the source {M = 2). Given the 
parameters a, fi, and G, the optimum can be determined for 
a specific 7, and let f{-f) = E{Z} + -^E {Z cxp(Z)r(0, Z)}- 
(E {Z'^ exp(Z)r(0, Z)} + D2). According to (22), if beam- 
forming is optimum, then (f) — \ [or, 1 — = 0], and /(7) < 0. 
This consistency is clearly reflected in Fig. 3. 

VI. Concluding remarks 

The optimum source covariance matrix of a noncoherent 
half-duplex AF MISO relay channel has been determined 
with channel mean feedback at the source. We have used a 
new method based on the Laplace transform order of two 
nonnegative random variables. Our results subsume as an 
asymptotic case the optimum precoding for a traditional MISO 
link. The superiority of the optimum transmit strategy over 
sub-optimum ones has been shown by simulations and is seen 
more pronounced at low to medium transmit SNR. Necessary 
and sufficient condition for optimality of beamforming has 
also been derived. It is expected that the powerful Laplace 
transform ordering approach used in this paper will find many 
applications pertaining to stochastic optimization problems in 
wireless communications and signal processing [22]. 






H|f/a = 50 






H||^/a = 20 
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Hl|'/a = 10 




H||V = 2 
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Fig. 2. Illustration of the impact of on capacity for M = 2, a = 0.1, 
G = 15 dB. 




Fig. 3. Illustration of the validity of (22) for M = 2, a = 0.5, = 
(-0.2163 + j0.0627 - 0.8328 + i0.1438)^, G = 10 dB. The function 
/(7) is given in Section V. 



Appendix I 

We introduce the key mathematical elements of this paper. 
Definition 1 [17, p. 95] Let Ti and T2 be two nonnegative 



random variables such that E{ 



,-sTi 



} > E{e-^^2},Vs > 0. 



Then Ti is said to be smaller than T2 in the Laplace transform 
order, denoted by Ti <lt 72. 

Definition 2 [17, p. 96] A function q: K is said 

to be completely monotone if all its derivatives g*^"-' exist and 
> 0, for all x > and n = 0,1,2,... (all 
nonnegative integer values). 

Lemma 1 Let T\ and T2 be two nonnegative random 
variables, and let d be any positive constant (d > 0). If 
Ti <LT T2, then E {log(l + dTi)} < E {log(l + dT^)] . 

Proof: The proof involves Theorem 3.B.4 (a) (p. 97) and 
Eq. (3.B.2) (p. 96) of [17], and is based on the fact that log(l + 



dx), d > 0, is a positive function in x when a; > with its first- 
order derivative being completely monotone (see Definition 2). 
Details can be found in [14]. □ 
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